Biomedical Named Entity Recognition at Scale

نویسندگان

چکیده

Named entity recognition (NER) is a widely applicable natural language processing task and building block of question answering, topic modeling, information retrieval, etc. In the medical domain, NER plays crucial role by extracting meaningful chunks from clinical notes reports, which are then fed to downstream tasks like assertion status detection, resolution, relation extraction, de-identification. Reimplementing Bi-LSTM-CNN-Char deep learning architecture on top Apache Spark, we present single trainable model that obtains new state-of-the-art results seven public biomedical benchmarks without using heavy contextual embeddings BERT. This includes improving BC4CHEMD 93.72% (4.1% gain), Species800 80.91% (4.6% JNLPBA 81.29% (5.2% gain). addition, this freely available within production-grade code base as part open-source Spark NLP library; can scale up for training inference in any cluster; has GPU support libraries popular programming languages such Python, R, Scala Java; be extended other human with no changes.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Biomedical Named Entity Recognition System

We propose a machine learning approach, using a Maximum Entropy (ME) model to construct a Named Entity Recognition (NER) classifier to retrieve biomedical names from texts. In experiments, we utilize a blend of various linguistic features incorporated into the ME model to assign class labels and location within an entity sequence, and a postprocessing strategy for corrections to sequences of ta...

متن کامل

At-least-N voting over biomedical named entity recognition systems

Biomedical named entity recognition (BNER) has been actively studied over the years, and several BNER systems have become publicly available. In this study, we investigate the utility of a simple voting method called at-least-n voting to improve gene name recognition, which takes advantage of the availability of BNER systems in the domain. We found this voting scheme is effective in combining B...

متن کامل

Biomedical Named Entity Recognition Using Neural Networks

We investigate the task of Named Entity Recognition (NER) in the domain of biomedical text. There is little published work employing modern neural network techniques in this domain, probably due to the small sizes of human-labeled data sets, as non-trivial neural models would have great difficulty avoiding overfitting. In this work we follow a semi-supervised learning approach: We first train s...

متن کامل

Reranking for Biomedical Named-Entity Recognition

This paper investigates improvement of automatic biomedical named-entity recognition by applying a reranking method to the COLING 2004 JNLPBA shared task of bioentity recognition. Our system has a common reranking architecture that consists of a pipeline of two statistical classifiers which are based on log-linear models. The architecture enables the reranker to take advantage of features which...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2021

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-68763-2_48